NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

DPI: Ensuring Strict Differential Privacy for Infinite Data Streaming

https://doi.org/10.1109/SP54263.2024.00124

Feng, Shuya; Mohammady, Meisam; Wang, Han; Li, Xiaochen; Qin, Zhan; Hong, Yuan (May 2024, IEEE)

Full Text Available
Local Differentially Private Heavy Hitter Detection in Data Streams with Bounded Memory

https://doi.org/10.1145/3639285

Li, Xiaochen; Liu, Weiran; Lou, Jian; Hong, Yuan; Zhang, Lei; Qin, Zhan; Ren, Kui (March 2024, Proceedings of the ACM on Management of Data)

Top-k frequent items detection is a fundamental task in data stream mining. Many promising solutions are proposed to improve memory efficiency while still maintaining high accuracy for detecting the Top-k items. Despite the memory efficiency concern, the users could suffer from privacy loss if participating in the task without proper protection, since their contributed local data streams may continually leak sensitive individual information. However, most existing works solely focus on addressing either the memory-efficiency problem or the privacy concerns but seldom jointly, which cannot achieve a satisfactory tradeoff between memory efficiency, privacy protection, and detection accuracy. In this paper, we present a novel framework HG-LDP to achieve accurate Top-k item detection at bounded memory expense, while providing rigorous local differential privacy (LDP) protection. Specifically, we identify two key challenges naturally arising in the task, which reveal that directly applying existing LDP techniques will lead to an inferior accuracy-privacy-memory efficiency tradeoff. Therefore, we instantiate three advanced schemes under the framework by designing novel LDP randomization methods, which address the hurdles caused by the large size of the item domain and by the limited space of the memory. We conduct comprehensive experiments on both synthetic and real-world datasets to show that the proposed advanced schemes achieve a superior accuracy-privacy-memory efficiency tradeoff, saving 2300× memory over baseline methods when the item domain size is 41,270. Our code is anonymously open-sourced via the link.
more » « less
Full Text Available
PrivacyAsst: Safeguarding User Privacy in Tool-Using Large Language Model Agents

https://doi.org/10.1109/TDSC.2024.3372777

Zhang, Xinyu; Xu, Huiyu; Ba, Zhongjie; Wang, Zhibo; Hong, Yuan; Liu, Jian; Qin, Zhan; Ren, Kui (January 2024, IEEE Transactions on Dependable and Secure Computing)

Full Text Available
MUter: Machine Unlearning on Adversarial Training Models

Liu, Junxu; Xue Mingsheng; Lou Jian; Zhang, Xiaoyu; Xiong, Li; Qin, Zhan (October 2023, International Conference on Computer Vision)

Full Text Available
ShapleyFL: Robust Federated Learning Based on Shapley Value

https://doi.org/10.1145/3580305.3599500

Sun, Qiheng; Li, Xiang; Zhang, Jiayao; Xiong, Li; Liu, Weiran; Liu, Jinfei; Qin, Zhan; Ren, Kui (August 2023, KDD '23: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining)

Full Text Available
Equitable Data Valuation Meets the Right to Be Forgotten in Model Markets

https://doi.org/10.14778/3611479.3611531

Xia, Haocheng; Liu, Jinfei; Lou, Jian; Qin, Zhan; Ren, Kui; Cao, Yang; Xiong, Li (July 2023, Proceedings of the VLDB Endowment)

The increasing demand for data-driven machine learning (ML) models has led to the emergence of model markets, where a broker collects personal data from data owners to produce high-usability ML models. To incentivize data owners to share their data, the broker needs to price data appropriately while protecting their privacy. For equitable data valuation , which is crucial in data pricing, Shapley value has become the most prevalent technique because it satisfies all four desirable properties in fairness: balance, symmetry, zero element, and additivity. For the right to be forgotten , which is stipulated by many data privacy protection laws to allow data owners to unlearn their data from trained models, the sharded structure in ML model training has become a de facto standard to reduce the cost of future unlearning by avoiding retraining the entire model from scratch. In this paper, we explore how the sharded structure for the right to be forgotten affects Shapley value for equitable data valuation in model markets. To adapt Shapley value for the sharded structure, we propose S-Shapley value, a sharded structure-based Shapley value, which satisfies four desirable properties for data valuation. Since we prove that computing S-Shapley value is #P-complete, two sampling-based methods are developed to approximate S-Shapley value. Furthermore, to efficiently update valuation results after data owners unlearn their data, we present two delta-based algorithms that estimate the change of data value instead of the data value itself. Experimental results demonstrate the efficiency and effectiveness of the proposed algorithms.
more » « less
Full Text Available
L-SRR: Local Differential Privacy for Location-Based Services with Staircase Randomized Response

https://doi.org/10.1145/3548606.3560636

Wang, Han; Hong, Hanbin; Xiong, Li; Qin, Zhan; Hong, Yuan (November 2022, In Proceedings of the 29th ACM Conference on Computer and Communications Security (CCS'22))

Full Text Available
OpBoost: a vertical federated tree boosting framework based on order-preserving desensitization

https://doi.org/10.14778/3565816.3565823

Li, Xiaochen; Hu, Yuke; Liu, Weiran; Feng, Hanwen; Peng, Li; Hong, Yuan; Ren, Kui; Qin, Zhan (October 2022, Proceedings of the VLDB Endowment)

Vertical Federated Learning (FL) is a new paradigm that enables users with non-overlapping attributes of the same data samples to jointly train a model without directly sharing the raw data. Nevertheless, recent works show that it's still not sufficient to prevent privacy leakage from the training process or the trained model. This paper focuses on studying the privacy-preserving tree boosting algorithms under the vertical FL. The existing solutions based on cryptography involve heavy computation and communication overhead and are vulnerable to inference attacks. Although the solution based on Local Differential Privacy (LDP) addresses the above problems, it leads to the low accuracy of the trained model. This paper explores to improve the accuracy of the widely deployed tree boosting algorithms satisfying differential privacy under vertical FL. Specifically, we introduce a framework called OpBoost. Three order-preserving desensitization algorithms satisfying a variant of LDP called distance-based LDP (dLDP) are designed to desensitize the training data. In particular, we optimize the dLDP definition and study efficient sampling distributions to further improve the accuracy and efficiency of the proposed algorithms. The proposed algorithms provide a trade-off between the privacy of pairs with large distance and the utility of desensitized values. Comprehensive evaluations show that OpBoost has a better performance on prediction accuracy of trained models compared with existing LDP approaches on reasonable settings. Our code is open source.
more » « less
Full Text Available
PrivLBS: Local Differential Privacy for Location-Based Services with Staircase Randomized Response

Wang, Han; Hong, Hanbin; Xiong, Li; Qin, Zhan; Hong, Yuan (January 2022, Proceedings of the ACM Conference on Computer and Communications Security)

Full Text Available
Towards Differentially Private Truth Discovery for Crowd Sensing Systems

https://doi.org/10.1109/ICDCS47774.2020.00037

Li, Yaliang; Xiao, Houping; Qin, Zhan; Miao, Chenglin; Su, Lu; Gao, Jing; Ren, Kui; Ding, Bolin (November 2020, The 40th International Conference on Distributed Computing Systems (ICDCS 2020))
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records